Enhancing Spam Comment Detection on Social Media with Emoji Feature and Post-Comment Pairs Approach using Ensemble Methods of Machine Learning

نویسندگان

چکیده

Every time a well-known public figure posts something on social media, it encourages many users to comment. Unfortunately, not all comments are relevant the post. Some spam which can disrupt overall flow of information. This research employed two strategies address issues in text detection media. The first strategy was utilizing emojis that had been frequently discarded studies. In fact, media use convey their intentions. second stacked post-comment pairs, different from systems solely focused comment-only data. pairs were required detect whether comment (not spam) or based post context. used SpamID-Pair dataset derived for Indonesian detection. After comprehensive investigation, emoji-text feature, and ensemble voting could boost performance (in terms accuracy F1). Adding manual features also improved performance. Based experiment, best stand-alone methods SVM (RBF kernel) soft method average

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

SMS Spam Detection using Machine Learning Approach

Over recent years, as the popularity of mobile phone devices has increased, Short Message Service (SMS) has grown into a multi-billion dollars industry. At the same time, reduction in the cost of messaging services has resulted in growth in unsolicited commercial advertisements (spams) being sent to mobile phones. In parts of Asia, up to 30% of text messages were spam in 2012. Lack of real data...

متن کامل

Fault Detection of Anti-friction Bearing using Ensemble Machine Learning Methods

Anti-Friction Bearing (AFB) is a very important machine component and its unscheduled failure leads to cause of malfunction in wide range of rotating machinery which results in unexpected downtime and economic loss. In this paper, ensemble machine learning techniques are demonstrated for the detection of different AFB faults. Initially, statistical features were extracted from temporal vibratio...

متن کامل

Detection and Filtering Spam using Feature Selection and Learning Machine Methods

In recent years, email has turned out to be among the pervasive and cost-effective tools of communication. In the meantime, spam emails have reduced their popularity and become offensive to all individuals and users applying this capability. Email filtering is the first solution to cope with this challenge. This is developed as a special type of text classification. A variety of methods includi...

متن کامل

Using Machine Learning Algorithms for Automatic Cyber Bullying Detection in Arabic Social Media

Social media allows people interact to express their thoughts or feelings about different subjects. However, some of users may write offensive twits to other via social media which known as cyber bullying. Successful prevention depends on automatically detecting malicious messages. Automatic detection of bullying in the text of social media by analyzing the text "twits" via one of the machine l...

متن کامل

Email Spam Detection A Machine Learning Approach

Machine learning is a branch of artificial intelligence concerned with the creation and study of systems that can learn from data. A machine learning system could be trained to distinguish between spam and non-spam (ham) emails. We aim to analyze current methods in machine learning to identify the best techniques to use in content-based spam filtering.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Access

سال: 2023

ISSN: ['2169-3536']

DOI: https://doi.org/10.1109/access.2023.3299853